PCTV 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
- . * ' / Internationa] Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/12, C07K 14/47, C12N 5/10, 
C07K 16/18 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 97/15667 

1 May 1997 (01.05.97) 



(21) International Application Number: PCT/US 96/ 17201 

(22) International Filing Date: 25 October 1996 (25.10.96) 



(30) Priority Data: 

60/007,015 



25 October 1995 (25.10.95) 



US 



(71) Applicant (for all designated States except US): REGENERON 

PHARMACEUTICALS, INC. [US/US]; 777 Old Saw Mill 
River Road, Tarrytown, NY 10591 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): DAVIS, Samuel [US/US]; 
Apartment B2, 332 W. 88th Street, New York, NY 10024 
(US). GALE, Nicholas, W. [US/US]; Apartment 5, 155 
Beacon Hill Road, Dobbs Ferry, NY 10522 (US). YAN- 
COPOULOS, George, D. [US/US]; 1519 Baptist Church 
Road, Yorktown Heights, NY 10598 (US). 

(74) Agents: KEMPLER, Gail, M. et al.; Regeneron Pharmaceuti- 
cals, Inc., 777 Old Saw Mill River Road, Tarrytown, NY 
10591 (US). 



(81) Designated States: AL, AM, AT, AU, A2, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, PL, 
PT, RO, RU, SD, SE, SG, SI, SK, TJ, TM, TR, TT, UA, 
UG, US, UZ, VN, ARIPO patent (KE, LS, MW, SD, SZ, 
UG), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI, CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: BIOLOGICALLY ACTIVE EPH FAMILY LIGANDS 
(57) Abstract 

A novel ligand (Efl-6) that binds the Elk subfamily of Eph receptors is identified, and methods for making the soluble Elf-6 ligand 
in biologically active form is described. A cDNA clone encoding this novel protein enables production of the recombinant protein, which 
is useful to support neuronal and other Eph receptor-bearing cell populations. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 1 



AM 


Armenia 


AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Congo 


CH 


Switzerland 


CI 


C6te d'lvoire 


CM 


Cameroon 


CN 


China 


CS 


Czechoslovakia 


CZ 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 


ES 


Spain 


FI 


Finland 


FR 


France 


GA 


Gabon 



GB 


United Kingdom 


MW 


Malawi 


GE 


Georgia 


MX 


Mexico 


GN 


Guinea 


NE 


Niger 


GR 


Greece 


NL 


Netherlands 


HU 


Hungary 


NO 


Norway 


IE 


Ireland 


NZ 


New Zealand 


IT 


Italy 


PL 


Poland 


JP 

KE r 


Japan 


PT 


Portugal 


Kenya 


RO 


Romania 


KG 


Kyrgystan 


RU 


Russian Federation 


KP 


Democratic People's Republic 


SD 


Sudan 




of Korea 


SE 


Sweden 


KR 


Republic of Korea 


SG 


Singapore 


KZ 


Kazakhstan 


SI 


Slovenia 


LI 


Liechtenstein 


SK 


Slovakia 


LK 


Sri Lanka 


SN 


Senegal 


LR 


Liberia 


sz 


Swaziland 


LT 


Lithuania 


TD 


Chad 


LU 


Luxembourg 


TG 


Togo 


LV 


Latvia 


TJ 


Tajikistan 


MC 


Monaco 


TT 


Trinidad and Tobago 


MD 


Republic of Moldova 


UA 


Ukraine 


MG 


Madagascar 


UG 


Uganda 


ML 


Mali 


US 


United States of America 


MN 


Mongolia 


uz 


Uzbekistan 


MR 


Mauritania 


VN 


Viet Nam 



WO 97/15667 PCT/US96/17201 

t 

BIOLOGICALLY ACTIVE EPH FAMILY LIGANDS 

• INTRODUCTION 
5-. • • 

The present invention provides for a novel ligand that binds 
proteins belonging to the Eph subfamily of receptorlike protein 
tyrosine kinases, such as the Elk receptor and methods for making 
soluble forms of this ligand that are biologically active. 
10- , . ■ : - 

BACKGROUND OF THE INVENTION 
The ability of polypeptide ligands to bind cells and thereby 
elicit a phenotypic response such as cell growth, survival or 
differentiation is often mediated through transmembrane tyrosine 
1 5 kinases. The extracellular portion of each receptor tyrosine kinase 
(RTK) is generally the most distinctive portion of the molecule, as it 
provides the protein with its ligand-recognizing characteristic. 
Binding of a ligand to the extracellular domain results in signal 
transduction, via an intracellular tyrosine kinase catalytic domain 
2 0 which transmits a biological signal to intracellular target proteins. 
The particular array of sequence motifs of this cytoplasmic, 
catalytic domain determines its access to potential kinase 
substrates (Mohammadi, et al.,1990, Mol". Cell. Biol., 11: 5068-5078; 
Fantl, et al., 1992, Cell, 69:413-413). 
2 5 RTKs appear to undergo dimerization or some related 

conformational change following ligand binding (Schlessinger, J., 
1 988, Trend Biochem. Sci. 1 3:443-447; Ullrich and Schlessinger, 

- 1 - 
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1990, Cell, 61:203-212; Schlessinger and Ullrich, 1992, Neuron 
9:383-391); molecular interactions between dimerizing cytoplasmic 
domains lead to activation of kinase function. In some instances, 
such as the growth factor platelet derived growth factor (PDGF), the 
5 ligand is a dimer that binds two receptor molecules (Hart, et al. , 
1988, Science, 240: 1529-1531; Heldin, 1989, J. Biol. Chem. 
264:8905-8912) while, for example, in the case of EGF, the ligand is 
a monomer (Weber, et al., 1984, J. Biol. Chem., 259:14631-14636). 
The tissue distribution of a particular tyrosine kinase 

1 0 receptor within higher organisms provides relevant data as to the 

biological function of the receptor. The tyrosine kinase receptors 
for some growth and differentiation factors, such as fibroblast 
growth factor (FGF) are widely expressed and therefore appear to 
play some general role in tissue growth and maintenance. Members 
15 of the Trk RTK family (Glass & Yancopoulos, 1993, Trends in Cell 

Biol, 3:262-268) of receptors are more generally limited to cells of 
the nervous system, and the Nerve Growth Factor family consisting 
of NGF, BDNF, NT-3 and NT-4/5 (known as the neurotrophins) which 
bind these receptors promote the differentiation of diverse groups 

2 0 of neurons in the brain and periphery (Lindsay, R. M, 1993, in 

Neurotrophic Factors, S.E. Loughlin & J.H. Fallon, eds., pp. 257-284 
(San Diego, CA: Academic Press). The localization of one such Trk 
family receptor, trkB, in tissue provided some insight into the 
potential biological role of this receptor, as well as the ligands that 
2 5 bind this receptor (referred to herein as cognates). Thus, for 
example, in adult mice. trkB was found to be preferentially 
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expressed in brain tissue, although significant levels of trkB mRNAs 
were also observed in lung, muscle, and ovaries. Further, trkB 
transcripts were detected in mid and late gestation embryos. In situ 
hybridization analysis of 14 and 18 day old mouse embryos indicated 
5 that trkB transcripts were localized in the central and peripheral 
nervous systems, including brain, spinal cord, spinal and cranial 
ganglia, paravertebral trunk of the sympathetic nervous system and 
various innervation pathways, suggesting that the trJ<B gene product 
may be a receptor involved in neurogenesis and early neural 
1 0 development as well as play a role in the adult nervous system. 

The cellular environment in which an RTK is expressed may 
influence the biological response exhibited upon binding of a ligand 
to the receptor. Thus, for example, when a neuronal cell expressing 
a Trk receptor is exposed to a neurotrophin which binds that 
15 receptor, neuronal survival and differentiation results. When the 
same receptor is expressed by a fibroblast, exposure to the 
neurotrophin results in proliferation of the fibroblast (Glass, et al., 
1991, Cell 66:405-413). Thus, it appears that the extracellular 
domain provides the determining factor as to the ligand specificity, 
2 0 and once signal, transduction is initiated the cellular environment 
will determine the phenotypic outcome of that signal transduction. 

A number of RTK families have been identified based on 
sequence homologies in their intracellular domain. . The receptor and 
signal transduction pathways utilized by NGF involves the product of 
2 5 the trk proto-oncogene (Kaplan et al., 1991, Nature 350:1 56-1 60; 
Klein et al., 1991, Cell 65:189-197). Klein et al. (1989, EMBO J. 
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11:3701-3709) reported the isolation of. trkB . which encodes a second 
member of the tyrosine protein kinase family of receptors found to 
be highly related to the human trk. protooncogene. TrkB binds and 
mediates.. the functional responses to BDNF, NT-4, and, to a lesser 
extent, NT-3 (Squinto, et al., 1991, Cell 65:885-903; Ip, et al., 
1992, Proc. Natl. Acad. Sci. U.S.A. §9:3060-3064; Klein; et al., 1992, 
Neuron, 8_:947-956). At the amino acid level, the products of trk and 
trkB were found to share 57 percent homology in their extracellular 
regions, including 9 of the 11 cysteines present in trk. This 
homology was found to increase to 88 percent within their 
respective tyrosine kinase catalytic domains. The Trk gene family 
has now been expanded to include the trkC locus, with NT-3 having 
been identified as the preferred ligand for trkC (Lamballe, et al., 
1991, Cell 66: 967-979; Valenzuela, et al. 1993, Neuron 10:963- 
974). 

The Eph-related transmembrane tyrosine kinases comprise 
the largest known family of receptor-like tyrosine kinases, with: 
many members displaying specific expression in the developing and 
adult nervous system. Two novel members of the Eph RTK family, 
termed Ehk (eph homology kinase) -1 and -2 were identified using a 
polymerase chain reaction (PCR)-based screen of genes expressed in 
brain (Maisonpierre, et al. 1993, Oncogene 8:3277-388). These genes 
appear to be expressed exclusively in the nervous system, with Ehk- 
1 expression beginning early in neural development. Recently, a new 
member of this group of. related receptors, Ehk-3 has been cloned 
(Valenzuela, et al. 1995, Oncogene 10:1573-1580). 
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The elk gene encodes a receptorlike protein-tyrosine kinase 
that also belongs to the eph subfamily, and which is expressed 
almost exclusively in the brain (and at lower levels in the testes) 
(Letwin, et al. 1988; Oncogene 3:621-678; Lhotak, et al., 1991 Mol. 
5 Cell. Biol. 11: 2496-2502). Based on its expression profile, the Elk 
receptor and its cognate ligand are expected to play a role in cell to 
cell interactions in the nervous system. Other members of the Eph 
family of receptors that fall within the same subclass as Elk include 
the Nuk/Cek5, Hek2/Sek4 and Htk receptors (Brambilla and Klein, 
10 1995, Mol. Cell. Neurosci., 6:487-495, Gale, et al., 1996, Neuron 
17:9-19). 

Unlike the Ehks and Elk receptors, the closely related Eck 
receptor appears to function in a more pleiotropic manner; it has 
been identified in neural, epithelial and skeletal tissues and it 

15 appears to be involved in the gastrulation, craniofacial, and limb bud 
sites of pattern formation in the mouse embryo (Ganju, et al. 1994, 
Oncogene 9:1613-1624). 

The identification of a large number of receptor tyrosine 
kinases has far exceeded the identification of their cognate ligands. 

20 At best, determination of the tissues in which such receptors are 
expressed provides insight into the regulation of the growth, 
proliferation and regeneration of cells in target tissues. Because 
RTKs appear to mediate a number of important functions during 
development, their cognate ligands will inevitably play a crucial role 

25 in development. 
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Although a number of schemes have been devised for the 
identification of cognate ligands for the many orphan receptors that 
have been identified, very few such ligands have been identified, and 
the ligandsthat have been identified to date appear to have no 
5 activity other than the ability to bind their cognate receptor. For 
example, International Publication Number WO/94/11020 published 
on May 26, 1994 describes ligands that bind to the Eck receptor. In 
particular the ligand EBP (also known as B61) is described. However, 
although binding of B61 to the Eck receptor is disclosed, no 

1 0 biological activity is described. Similarly, despite the description 
in PCT Publication Number W094/1 1384 (published May 26, 1994) of 
a ligand that binds the Elk receptor, no biological activity was 
observed, regardless of whether the ligand was presented as 
membrane bound or in the form of an Fc dimer of the soluble ligand. 

15 With respect to the Elk receptor, however, chimeric EGFR-Elk 

receptors (having the extracellular domain of the EGFR fused to the 
Elk cytoplasmic domain) have been used to demonstrate "the 
functional integrity (as measured by EGF-stimulated 
autophosphprylation) of the enzymatic domain of this receptor. 

20 (Lhotak and Pawson, 1993, Mol. Cell. Biol. 13:7071-7079): 

SUMMARY OF THE INVENTION 

The present invention provides for a novel polypeptide 
25 ligand, designated as Efl-6, that binds to the Elk, Nuk/Cek5, 

Hek2/Sek4, Htk, and Sekt receptors on cells. More importantly, the 
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invention provides a means of making biologically active, soluble 
forms of this ligand, which are useful in promoting a differential 
function and/or influencing the phenotype, such as growth and/or 
proliferation, of receptor bearing cells. The invention also provides 
5 for nucleic acids encoding such polypeptide ligands, and both 

prokaryotic and eukaryotic expression systems for producing such 
proteins. The invention also provides for antibodies to these ligands. 

According to the invention, soluble forms of the ligands 
described herein may be used to promote biological responses in Elk, 

10 Nuk/Cek5, Hek2/Sek4, Htk, and Sek1 receptor-expressing cells. In 
particular, a general method is described herein which produces 
"clustering" of ligands for eph-related receptors, which functions to 
make otherwise inactive soluble ligands biologically active, or 
which enhances the biological activity of ligands that, absent such 

1 5 clustering, would have only low levels of biological activity. 

The ligands described herein also have diagnostic utilities. 
In particular embodiments of the invention, methods of detecting 
aberrancies in their function or expression may be used in the 
diagnosis of neurological or other disorders. 

2 0 In other embodiments, manipulation of the interaction 

between the ligands and their cognate receptor may be used in assay 
systems designed to identify both agonists and antagonists of Eph 
receptor ligands. Such agonists and antagonists may be developed 
for use in the eventual treatment of neurological or other disorders. 
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BRIEF DESCRIPTION OF THE F1GURFS 

Figure 1. Nucleotide and encoded protein sequence of Efl-6. The 
putative signal sequence is encoded by about nucleotide 202 to about 
5 . nucleotide 273. The coding region of the mature protein begins at 
about nucleotide. 274 and ends at about nucleotide 1224. The 
deposited clone has an A at position 698. This change created, an 
amino acid change from Q (Gin) to R (Arg). The coding region for the 
putative transmembrane domain is shown underlined. The amino acid 
10 sequence of the encoded extracellular domain, which is encoded by 
about nucleotide 274 to about nucleotide 873, is shown in bold 
letters. 

DETAILED DESCRIPTION OF THE INVFNTION 

1 5 

The present invention provides for a novel polypeptide 
ligand that binds to the Elk receptor. The novel polypeptide ligand of 
the present invention is also able to bind other members of the Elk 
subclass of Eph receptors, including Nuk/Cek5, Hek2/Sek4 and Htk, 

20 as well as the only receptor known to "cross subclasses", known as 
Sek1 (Brambilla and Klein, 1995, Mol. Cell. Neurosci., 6:487-495, 
Gale, et al., 1996, Neuron 17:9-19). Accordingly, as used herein, the 
"Elk" receptor refers to Elk, as well as the above receptors known to 
bind the Elk ligands. 

25 The invention further provides a means of making 

biologically active, soluble forms of the Efl-6 ligand, which are 
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useful in promoting a differential function and/or influencing the 
phenotype, such as growth and/or proliferation, of receptor bearing 
cells. The invention also provides for nucleic acids encoding such a 
polypeptide ligand, and both prokaryotic and eukaryotic expression 
5 systems for producing this protein. The invention also provides for 
antibodies to this ligand. 

The novel ligand described herein is designated as Efl (Eph 
transmembrane tyrosine kinase family ligands)-6. A deposit 
designated as pbluescript SK'encoding Efl-6 was made with the 

10 American Type Culture Collection on October 19, 1995 and has 
received accession number 97319. 

According to the invention, soluble forms of the Elk ligand 
(referred to herein as Efl-6) may be used to promote biological 
responses in Elk receptor-expressing cells. In particular, a general 

1 5 method is described herein which produces "clustering" of Efl-6 
ligand which functions to make otherwise inactive soluble ligand 
biologically active, or which enhances the biological activity 'of the 
ligand which, absent such clustering, would have only low levels of 
biological activity. 

20 The Efl-6 ligand described herein may also have diagnostic 

utilities. In particular embodiments of the invention, methods of 
detecting aberrancies in its function or expression may be used in 
the diagnosis of neurological or other disorders. In other 
embodiments, manipulation of the interaction between the ligand and 
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its cognate receptor may be used in the treatment of neurological or 
other disorders. 

When used herein, Efl-6 includes functionally equivalent 
molecules in which amino acid residues are substituted for residues 
within the sequence resulting in a silent change. For example, one or 
more amino acid residues within the sequence can be substituted by 
another amino acid of a similar polarity which acts as. a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino 
acid within the sequence may be selected from other members of the 
class to which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan and methionine. The polar 
neutral amino acids include glycine, serine, . threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) 
amino acids include arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 
Also included within the scope of the invention are proteins or 
fragments or derivatives thereof which exhibit the same or similar 
biological activity and derivatives which are differentially modified 
during or after translation, e.g. . by glycosylation, proteolytic 
cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. 

Cells that express Efl-6 may do so naturally or may be 
genetically engineered to produce this ligand, as described supra , by 
transfection, transduction, electroporation, microinjection, via a 
transgenic animal, etc. of nucleic acid encoding Efl-6 described 
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herein in a suitable expression vector. A vector containing the cDNA 
encoding for EFI-6 deposited with the American Type Culture 
Collection under the terms of the Budapest Treaty on October 19, 
1995 as. pBluescriptSK-Efl-6 has been given the ATCC designation 
5 97319. 

The present invention encompasses the DNA sequence 
contained in the above deposited plasmid, as well as DNA and RNA 
sequences that hybridize to the Efl-6 encoding sequence contained 
therein, under conditions of moderate stringency, as defined in, for 

10 example, Sambrook, et al. Molecular Cloning: A Laboratory Manual, 2 
ed. Vol. 1, pp. 101-104, Cold Spring Harbor Laboratory Press (1989). 
Thus, nucleic acids contemplated by the invention include the 
sequence as contained in the deposit and as set forth in Figure 1, 
sequences of nucleic acids that hybridize to such sequence and which 

15 bind the Elk receptor, and nucleic acid sequences which are 

degenerate of the above sequences as a result of the genetic code, 
but which encode ligand(s) that bind the Elk receptor. 

In addition, the present invention contemplates use of the 
ligands described herein in soluble forms, truncated forms, and 

20 tagged forms. This includes monomeric forms of the ligand which 
may bind to the receptor and function as an antagonist. 

Any of the methods known to one skilled in the art for the 
insertion of DNA fragments into a vector may be used to construct 
expression vectors encoding Efl-6 using appropriate 

25 transcriptional/translational control signals and the protein coding 
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sequences. These methods may include in vitro recombinant DNA and 
synthetic techniques and in vivo recombinations (genetic 
recombination). Expression of nucleic acid sequence encoding the 
Efl-6 or peptide fragments thereof may be regulated by a second . 
5 nucleic acid sequence so that the protein or peptide is expressed in a 
host transformed with the recombinant DNA molecule. For example, 
expression of the Efl-6 described herein may be controlled by any 
promoter/enhancer element known in the art. Promoters which may 
be used to control expression of the ligands include, but are not 

10 limited to the long terminal repeat as described in Squinto et al., 
(1991, Cell 65:1-20); the SV40 early promoter region (Bernoist and 
Chambon, 1981, Nature 220:304-310), the CMV promoter, the M-MuLV 
5' terminal repeat the promoter contained in the 3' long terminal 
repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 22:787- 

15 797), the herpes thymidine kinase promoter (Wagner et al., 1981, 
Proc. Natl. Acad. Sci. U.S.A. 78:1 44-1 445), the regulatory sequences 
of the metallothioein gene (Brinster et al., 1982, Nature 296 :39-42): 
prbkary otic expression vectors such as the p-lactamase promoter 
(Villa-Kamaroff, et al., 1978, Proc. Natl. Acad. Sci. U.S.A. J_5_:3727- 

20 3731), or the tac. promoter (DeBoer, et al., 1983, Proc. Natl. Acad. 
Sci. U.S.A. 8_p_:21-25), see also "Useful proteins from recombinant 
bacteria" in Scientific American, 1980, 242 :74-94; promoter 
elements from yeast or other fungi such as the Gal 4 promoter, the 
ADH (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) 

25 promoter, alkaline phosphatase promoter, and the following animal 
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transcriptional control regions, which exhibit tissue specificity and 
have been utilized in transgenic animals: elastase I gene control 
region which is active in pancreatic acinar cells (Swift, et al., 1984, 
Cell M:639-646; Ornitz, et al., 1986, Cold Spring Harbor Symp. 
Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425-515); 
insulin gene control region which is active in pancreatic beta cells " 
(Hanahan, 1985, Nature 215:115-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl, et al., 1984, 
Cell 38:647-658; Adames, et al., 1985, Nature 31^:533-538; 
Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), mouse 
mammary tumor virus control region which is active in testicular, 
breast, lymphoid and mast cells (Leder, et al., 1986, Cell 45_:485- 
495), albumin gene control region which is active in liver (Pinkert, 
et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf, et al., 1985, Mol. 
Cell. Biol. 5:1639-1648; Hammer et al., 1987, Science 23_5_:53-58); 
alpha 1 -antitrypsin gene control region which is active in the liver 
(Kelsey, et al, 1987, Genes and Devel. 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram, et al., 1985, 
Nature 315:338-340; Kollias, et al., 1986, Cell 46:89-94); myelin 
basic protein gene control region which is active in oligodendrocyte 
cells in the brain (Readhead, et al., 1987, Cell 48_:703-712); myosin 
light chain-2 gene control region which is active in skeletal muscle 
(Shani, 1985, Nature 3J_4:283-286), and gonadotropic releasing 
hormone gene control region which is active in the hypothalamus 
(Mason et al., 1986, Science 234:1372-1378). 
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Thus, according to the invention, expression vectors capable 
of being replicated in a bacterial or eukaryotip host comprising Efl- 
6 encoding nucleic acid as described herein, are used to transfect 
the host and thereby direct expression of such nucleic acid to 
5 produce the Efl-6 proteins, which may then be recovered in 

biologically active form. As used herein, a biologically active form 
includes a form capable of binding to the relevant receptor, such as 
Elk, and causing a differentiated function and/or influencing the 
phenotype of the cell expressing the receptor. Such biologically 
10 active forms would, for example, induce phosphorylation of the 
tyrosine kinase domain of the Elk receptor, or stimulation of 
synthesis of cellular DNA. Alternatively, biologically active Elf-6 
ligand includes monomeric forms that bind the receptor and act as 
antagonists. 

1 5 Expression vectors containing the gene inserts can be 

identified by three general approaches: (a) DNA-DNA hybridization, 
(b) presence or absence of "marker" gene functions, and (c) 
expression of inserted sequences. In the first approach, the 
presence of a foreign gene inserted in an expression vector can be 

20 detected by DNA-DNA hybridization using probes comprising 

sequences that are homologous to an inserted efl -6 gene. In the 
second approach, the recombinant vector/host system can be 
identified and selected based upon the presence or absence of 
certain "marker" gene functions ( e.g. . thymidine kinase activity, 

25 resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of foreign 
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genes in the vector. For example, if the efl -6 gene is inserted 
within the marker gene sequence of the vector, recombinants 
containing the insert can be identified by the absence of the marker 
gene function. In the third approach, recombinant expression vectors 
can be identified, by assaying the foreign gene product expressed by 
the recombinant. Such assays can be based, for example, on the 
physical or functional properties of the efl -6 gene product, for 
example, by binding of the ligand to the Elk receptor or portion 
thereof which may be tagged with, for example, a detectable 
antibody or portion thereof or binding to antibodies produced against 
the Efl-6 protein or a portion thereof. 

Efl-6 appears to be a conventional transmembrane protein 
with a cytoplasmic domain. The transmembrane domain is shown 
underlined in Figure 1. Accordingly, the soluble or extracellular 
domain of the ligand (sEfl-6) is encoded by the nucleotide sequence 
from about nucleotide 274 to about nucleotide 873. 

The ligands described herein may be produced as membrane 
bound forms in animal cell expression systems or may be expressed 
in soluble form. Soluble forms of the ligands may be expressed using 
methods known to those in the art. A commonly used strategy 
involves use of oligonucleotide primers, one of which spans the N- 
terminus of the protein, the other of which spans the region just 
upstream to a hydrophobic segment of the protein, which represents 
either the GPI-linkage recognition domain or a transmembrane 
domain of the protein. The oligonucleotide spanning the C-terminus 
region is modified so as to contain a stop codon prior to the 
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hydrophobic domain. The two oligonucleotides are used to amplify a 
modified version of the gene encoding a protein that is secreted 
instead of membrane bound. Alternatively, a convenient restriction 
site in the vector can be used to insert an altered sequence that 
5 removes the GPI-linkage recognition domain or transmembrane 
domain, thus resulting in a vector capable of expressing a secreted 
form of the protein. The soluble protein so produced would include 
the region of the protein from the N- terminus to the region 
preceding the hydrophobic GPI recognition domain or transmembrane 

1 0 domain. 

Applicants have discovered that although the soluble ligands 
produced according to the invention bind to the receptors in the eph 
subfamily, such soluble ligands often have little or no biological 
activity. Such soluble ligands are activated, according to the 
15 present invention, by ligand "clustering". "Clustering" as used 
herein refers to any method known to one skilled in the art for 
creating multimers of the soluble portions of ligands described 
herein. 

In one embodiment, a "clustered" efl-6 is a dimer, made for 

2 0 example, according to the present invention utilizing the Fc domain 

of IgG (Aruffo, et al., 1991, Cell 67:35-44), which results in the 
expression , of the soluble ligand as a disulfide-linked homodimer. In 
another embodiment, secreted forms of the ligand are constructed 
with epitope tags at their C-termini; anti-tag antibodies are then 
2 5 used to aggregate the ligands. 
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In addition, the invention contemplates other "engineered" 
ligand molecules that exist as or form multimers. For example, 
dimers of the extracellular domains may be engineered using leucine 
zippers. The leucine zipper domains of the human transcription 
factors c-jun and c-fos have been shown to form stable 
heterodimers [Busch and Sassone-Corsi, Trends Genetics 6: 36-40 
(1990); Gentz, et al., Science 243: 1695-1699 (1989)] with a 1:1 
stoichiometry. Although jun-jun homodimers have also been shown 
to form, they are about 1000-fold less stable than jun-fos 
heterodimers. Fos-fos homodimers have not been detected. The 
leucine zipper domain of either c-jun or c-fos are fused in frame at 
the C-terminus of the soluble or extracellular domains of the above 
mentioned ligands by genetically engineering chimeric genes. The 
fusions may be direct or they may employ a flexible linker domain, 
such as the hinge region of human IgG, or polypeptide linkers 
consisting of small amino acids such as glycine, serine, threonine or 
alanine, at various lengths and combinations: Additionally, the 
chimeric proteins may be tagged by His-His-His-His-His-His (His6), 
to allow rapid, purification by metal-chelate chromatography, and/or 
by epitopes to which antibodies are available, to allow for detection 
on western blots, immunoprecipitation, or activity 
depletion/blocking in bioassays. 

Alternatively, multimers may be made by genetically 
engineering and expressing molecules that consist of the soluble or 
extracellular portion of the ligand followed by the Fc-domain of 
hlgG, followed by either the c-jun or the c-fos leucine zippers 
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described above [Kostelny, et al. f J. Immunol. 148: 1547-1553 
(1992)]. Since these leucine zippers form predominately 
heterodimers, they may be used to drive formation of the 
heterodimers where .desired. As for the chimeric proteins described 
5 using leucine zippers, these may also be tagged with metal chelates 
or an epitope. This tagged domain can be used for rapid purification 
by metal-chelate chromatography, and/or by antibodies, ;to allow for 
detection on western blots, immunoprecipitation, or activity 
depletion/blocking in bioassays. 

10 In another embodiment of the invention, multimeric soluble 

ligands are prepared by expression as chimeric molecules utilizing 
flexible linker loops. A DNA construct encoding the chimeric protein 
is designed such that it expresses two or more soluble or 
extracellular domains fused together in tandem ("head to head") by a 

15 flexible loop. This loop may be entirely artificial (e.g. poiyglycine 
repeats interrupted by serine or threonine at a certain interval) or 
"borrowed" from naturally occurring proteins (e.g. the hinge region of 
hlgG). Molecules may be 1 engineered in which the length and 
composition, of the loop is varied, to allow for selection of 

20 molecules with desired characteristics. Although not wishing to be 
bound by theory, applicants believe that membrane attachment of the 
ligands facilitates ligand clustering, which in turn promotes 
. receptor multimerization and activation. Thus, according to the 
invention, biological activity of the soluble ligand is achieved by 

2 5 mimicking, in solution, membrane associated ligand clustering. 
Thus, a biologically active, clustered soluble eph family ligand 
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comprises (soluble Efl) n , wherein the soluble efl is the extracellular 
domain of a ligand that binds an eph family receptor and n is 2 or 
greater. As described herein, Efl-6 is made biologically active 
according to the process of the invention. 

In each case, one skilled in the art will recognize that the 
success of clustering will require analysis of the biological activity 
utilizing bioassays such as those described herein. For example, 
receptor phosphorylation induced by stimulating receptor expressing 
reporter cells with COS cells overexpressing membrane forms of the 
ligands, soluble forms of the ligands and clustered ligands may be 
compared. 

Although in some instances dimerization of the ligand is 
sufficient to induce biological activity, in certain instances, the 
methods described herein are used to determine the sufficiency of a 
particular clustering technique. Often dimerization of a soluble 
ligand utilizing Fc appears to be insufficient for achieving a 
biological response, yet further clustering of the ligand according to 
the invention using anti-Fc antibodies may result in a substantial 
increase in biological activity. 

Cells of the present invention may transiently or, 
preferably, constitutively and permanently express Efl-6 in native 
form, or in soluble form as tagged Efl-6 or clustered Efl-6 as 
described herein. 

The recombinant factor may be purified by any technique 
which allows for the subsequent formation of a stable, biologically 
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active protein. For example, and not by way of limitation, the factor 
may be recovered from cells/either as a soluble protein or as 
inclusion bodies, from which it may be extracted quantitatively by 
8M guanidinium hydrochloride and dialysis. In order to further purify 
5 the factor, conventional .ion exchange chromatography, hydrophobic 
interaction chromatography, reverse phase chromatography or gel 
filtration may be used. 

In additional embodiments of the invention, recombinant 
efl-6 may be used to inactivate or "knock out" the endogenous gene 

1 0 by homologous recombination,, and thereby create an Efl-6 protein 
deficient cell, tissue, or animal. For example, and not by way of 
limitation, recombinant efl may be engineered to contain an 
insertional mutation, for example the neo gene, which would 
inactivate the native efl -6 gene. Such a construct, under the 

1 5 control of a suitable promoter, may be introduced into a cell, such as 
an embryonic stem cell, by a technique such as transfection, 
transduction, injection, etc. Cells containing the construct may then 
be selected by G418 resistance. Cells which lack an intact efl -6 
may then be identified, e.g. by Southern blotting or Northern blotting 

20 or assay of expression. Cells lacking an intact efl -6 may then be 

fused to early embryo cells to generate transgenic animals deficient 
in such ligand. A comparison of such an animal with an animal 
expressing endogenous Efl-6 would aid in the elucidation of the role 
of the ligands in development and maintenance. Such an animal may 

25 be used to define specific neuronal populations, or any other in vivo 
processes, normally dependent upon the ligand, 
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The present invention also provides for antibodies to the 
Efl-6 described herein which are useful for detection of the ligand 
in, for example, diagnostic applications. Antibodies to the ligand 
may also be useful for achieving clustering according to the 
5 invention. In instances where endogenous ligand exists, the antibody 
itself may act as the therapeutic by activating existing ligand. 

For preparation of monoclonal antibodies directed toward 
Efl-6, any technique which provides for the production of antibody 
molecules by continuous cell lines in culture may be used. For 
10 example, the hybridoma technique originally developed by Kohler and 
Milstein (1975, Nature 256_:495-497) , as well as the trioma 
technique; the human B-cell hybridoma technique (Kozbor, et al., 
1983, Immunology Today 4:72), and the EBV-hybridoma technique to 
produce human monoclonal antibodies (Cole, et al., 1985, in 

1 5 "Monoclonal Antibodies and Cancer Therapy," Alan R. Liss, Inc. pp. 77- 

96) and the like are within the scope of the present invention. 

The monoclonal antibodies for diagnostic or therapeutic use 
may be human monoclonal antibodies or chimeric human-mouse (or 
other species) monoclonal antibodies. Human monoclonal antibodies 
20 may be made by any of numerous techniques known in the art ( e.g. . 

Teng, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:7308-7312; Kozbor, 
et al., 1983, Immunology Today 4:72-79; Olsson, et al., 1982, Meth. 
Enzymol. 92:3-16). Chimeric antibody molecules may be prepared 
containing a mouse antigen-binding domain with human constant 

2 5 regions (Morrison et al., 1984, Proc. Natl. Acad. Sci. U.S.A. 81:6851, 

Takeda et al., 1985, Nature 314:452). 
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Various procedures known in the art may be used for the 
production of polyclonal antibodies to epitopes of the Efl-6 
described herein. For the production of antibody, various host 
animals can be immunized by injection with the Efl-6, or a fragment 
or derivative thereof, including but not limited to rabbits, mice, 
rats, etc. Various adjuvants may be used to increase the - 
immunological response, depending on the host species, and including 
but not limited to Freund's (complete and incomplete), mineral gels 
such as aluminum hydroxide, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanins, dinitrophenpl, and potentially useful 
human adjuvants such as BCG (Bacille Calmette-Guerin) and 
Corynebacterium parvum . 

A molecular clone of an antibody to a selected Efl-6 epitope 
can be prepared by known techniques. Recombinant DNA methodology 
(see e.g., Maniatis, et al., 1982, Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York) 
may be used to construct nucleic acid sequences which encode a 
monoclonal antibody molecule, or antigen binding region thereof. 

Antibody molecules may be purified by known techniques, 
e-9-. immunoabsorption or immunoaffinity chromatography, 
chromatographic methods such as HPLC (high performance liquid 
chromatography), or a combination thereof, etc. 

The present invention provides for antibody molecules as 
well as fragments of such antibody molecules. Antibody fragments 
which contain the idiotype of the molecule can be generated by 
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known techniques. ,For example, such fragments include but are not 
limited to: the F(ab') 2 fragment which can be produced by pepsin 
digestion of the antibody molecule; the Fab' fragments which can be 
generated by reducing the disulfide bridges of the F(ab') 2 fragment, 
5 and the Fab fragments which can be generated by treating the 
antibody molecule with papain and a reducing agent. 

The present invention also provides for methods of treating 
a patient suffering from a neurological disorder comprising treating 
the patient with an effective amount of Efl-6, peptide fragments 

1 0 thereof, or derivatives thereof capable of binding to Elk receptor. 

The Elk receptor is also expressed primarily in brain. 
Accordingly, it is believed that the Elk binding ligand described 
herein will support the induction of a differential function and/or 
influence the phenotype, such as growth and/or survival of neural 

15 cells, expressing this receptor. As described in Gale, et al., 1996, 
Oncogene 13:1343-1352, Elk-6 (described as Elk ligand 3 in the 
reference) is notable for its remarkable restricted and prominent 
expression in the floor plate and roof plate of the developing neural 
tube and its rhombomere-specific expression in the developing 

20 hindbrain. This distribution suggests a role of Efl-6 and its 

reciprocal receptor, in neuronal guidance and boundary formation, 
critical features in the organization of the developing vertebrate 
central nervous system. 

The present invention also provides for pharmaceutical 

2 5 compositions comprising the Efl-6 described herein, peptide 
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fragments thereof, or derivatives in a suitable pharmacologic 
carrier. - 

The Efl-6 proteins, peptide fragments, or derivatives may 
be administered systemically or locally. Any appropriate mode of 
5 administration known in the art may be used, including, but not 
limited to, intravenous, intrathecal, intraarterial, intranasal, oral, 
subcutaneous, intraperitoneal, or by local injection or surgical 
implant. Sustained release formulations are also provided for. 
As our understanding of neurodegenerative 

10 disease/neurotrauma becomes clearer, it may become apparent that 
it would be beneficial to decrease the effect of endogenous Efl-6. 
, Therefore, in areas of nervous system trauma, it may be desirable to 
provide Efl-6 antagonists, including, but not limited to, soluble 
forms of Efl-6 which may compete with cell-bound ligand for 

15 interaction with Elk receptor. Alternatively, soluble forms of the 
Elk receptors (e.g. expressed as "receptorbodies" produced as 
described herein) may act as antagonists by binding, and thereby 
inactivating the ligand. It may be desirable to provide such 
antagonists locally at the injury site rather than systemically. Use 

20 of an Efl-6 antagonist providing implant may be desirable. 

Alternatively, certain conditions may benefit from an 
increase in Efl-6 responsiveness. It may therefore be beneficial to 
increase the number or binding affinity of Efl-6 in patients suffering 
from such conditions. This could be achieved through gene therapy 

25 using either Efl-6, Efl-6 expressing cells, or Elk receptor or 

receptor chimeras (cells expressing the extracellular domain of the 
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Elk receptor). Selective expression of such recombinant proteins in 
appropriate cells could be achieved using their encoding genes 
controlled by tissue specific or inducible promoters or by producing 
localized infection with replication defective viruses carrying the 
5 recombinant genes. 

The Efl-6 encoding DNA as deposited with the ATCC and 
having accession number 97319 was isolated from a Stratagene (La 
Jolla, California) human brain (frontal cortex) library (Catalogue No. 
936212). The library is in the XZAPII vector. The sequence of the 

1 0 Efl-6 coding region of this vector is set forth in Figure 1 . 

Assays or purification of the Efl-6 protein may be 
conducted by use of an Elk receptorbody, which consists of the 
extracellular domain of Elk fused to the IgGt constant region. This 
receptorbody is prepared as follows: The Fc portion of human lgG1, 

1 5 starting from the hinge region and extending to the carboxy terminus 
of the molecule, was cloned from placental cDNA using PCR with 
oligonucleotides corresponding to the published sequence of human 
lgG1. Convenient restriction sites were also incorporated into the 
oligonucleotides so as to allow cloning of the PCR fragment into an 

20 expression vector. Expression vectors containing full length 

receptors were modified either by restriction enzyme digests or by 
PCR strategies so as to replace the transmembrane and intracellular 
domains with restriction sites that allow cloning the human lgG1 
fragment into these sites; this was done in such a way as to 

25 generate a fusion protein with the receptor ectodomain as its 
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amino-terminus and the Fc portion of human lgG1 as its carboxy- 
terminus. An alternative method of preparing receptorbodies is 
described in Goodwin, et. al. 1993, Cell 73:447-456. 



DEPOSIT OF MICROORGANISMS 

The following vector been deposited with the American Type 
Culture Collection, 12301 Parklawn Drive, Rockville,. Maryland 
20852 in accordance with the Budapest Treaty. 

DEPOSIT ACCESSION Ml 1MRFR 

pBluescript SK'Efl-6 97319 

The present invention is not to be limited in scope by the 
specific embodiments described herein. Indeed, various 
modifications of the invention in addition to those described herein 
will become apparent to those skilled , in the art from the foregoing 
description and accompanying figures. Such modifications are 
intended to fall within the scope of the appended claims. 

Various references are cited herein, the disclosures of 
which are incorporated by reference in their entireties. 
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CLAIMS 

1 . An isolated and purified nucleic acid molecule encoding Efl-6 
protein wherein the sequence of said nucleic acid is selected from 
the group consisting of: 

(a) the sequence of the DNA encoding mature Efl-6 protein 
contained in the plasmid pBluescriptSICEfl-6 as. deposited with the 
American Type Culture Collection on October 19, 1995 and 
designated as 97319; 

(b) the sequence of the DNA encoding mature Efl-6 protein as 
set forth in Figure 1; 

(c) DNA sequences that hybridize under moderately stringent 
conditions to the DNA of (a) or (b) and which encode a protein that 
binds a receptor belonging to the Elk subfamily of Eph receptors; and 

(d) DNA sequences that are degenerate as a result of the 
genetic code to a DNA sequence of (a), (b), or (c) and which encode an 
Efl-6 protein that binds a member of the Elk subclass of Eph 
receptors. . 

2. Isolated and purified mature Efl-6 protein having an amino 
acid sequence as set forth in Figure 1. 
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\ 3. An isolated nucleic acid encoding the extracellular domain of 
Efl-6 (sEfl-6) having a sequence selected from the following: 

(a) the sequence set forth from about nucleotide .274 to about 
5 nucleotide 873 of Figure 1; and 

(b) a sequence which encodes the extracellular domain of Efl- 
6 as set forth in Figure 1. 

■ V 4. Purified sEfl-6 encoded by the nucleotide sequence of claim 3. 
10 

-\) 5. (sEfl-6)n comprising the sEfl-6 protein according to claim 4, 
wherein n is 2 or greater. . 

^ 6. Efl-6 ligandbody comprising soluble Efl-6 protein according to 
15 claim 4 and the Fc portion of IgG. 

v 7. A vector which comprises a nucleic acid molecule of claim 1. 

x) 8. A vector according to claim 7 wherein the nucleic acid 
20 molecule is operatively linked to an expression control sequence 
capable of directing its expression in a host cell. 
^ 9. A host cell containing a vector according to claim 8. 

\) 10. A vector which comprises a nucleic acid molecule of claim 3. 
25 
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"y 11. A vector according to claim 10 wherein the nucleic acid 
molecule is operatively linked to an expression control sequence 
capable of directing its expression in a host cell. 

5 /12. A host cell containing a vector according to claim 11. 

v>. 13. A method of producing Efl-6 ligand which comprises growing 
cells of a host according to claim 8 under conditions permitting 
production of the ligand, and recovering the ligand so produced. 

10 

v, 14. A method of producing Efl-6 soluble ligand which comprises 
growing cells of a host according to claim 11 under conditions 
permitting production of the ligand, and recovering the ligand so 
produced. 

15 

5 15. An antibody which specifically binds the ligand of claim 2 or 
4. 

) 16. An antibody according to claim 15 which is a monoclonal 
20 antibody. 
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FIGURE 1 



10 20 30 40 50 60 70 80 



GAATTCCCAC CCCGGGATCT GTGAGACTGA GCGCTCTGCC GCGGGGGCGC GGGCACAGCA GGAARCAGGT CCGCGTGGGC 
CTTAAGGGTG GGGCCCTAGA CACTCTGACT CGCGAGACGG CGCCCCCGCG CCCGTGTCGT CCTTYGTCCA GGCGCACCCG 

90 100 110 120 130 140 150 160 

GCTGGGGGCA TCAGCTACCG GGGTGGTCCG GGCTGAAGAG CCAGGCAGCC AAGGCAGCCA CCCCGGGGGG TGGGCGACTT 
OSACCCCCGT AGTCGATGGC CCCACCAGGC CCGACTTCTC GGTCCGTCGG TTCCGTCGGT GGGGCCCCCC ACCCGCTGAA 

170 180 190 200 210 220 230 



TGGGGGAGTT GGTGCCCCGC CCCCCAGGCC TTGGCGGGGT C ATG GGG CCC CCC CAT TCT GGG CCG GGG GGC 
ACCCCCTCAA CCACGGGGCG GGGGGTCCGG AACCGCCCCA G TAC CCC GGG GGG GTA AGA CCC GGC CCC CCG 

M G P P H S G P G G> 







240 






250 




260 






270 






280 




290 






GTG 
CAC 
V 


CGA 
GCT 
R 


GTC 
CAG 
V 


GGG 
CCC 
G 


GCC 
CGG 
A 


CTG 
GAC 
L 


CTG 
GAC 
L 


CTG 
GAC 
L 


CTG 
GAC 
L 


GGG 
CCC 
G 


GTT TTG 
CAA AAC 
V L 


GGG 
CCC 
G 


CTG 
GAC 
L 


GTG 
CAC 

V 


TCT 
AGA 
s 


GGG 
CCC 
G 


CTC 
GAG 
L 


AGC 
TCG 
S 


CTG 
GAC 
X, 


GAG 
CTC 
E 


CCT 
GGA 
P> 


300 






310 




320 






330 






340 




350 






360 




GTC 
CAG 
V 


TAC 
ATG 

y 


TGG 
ACC 
w 


AAC 
TTG 
N 


TCG 
AGC 
S 


GCG 
CGC 
X 


AAT AAG 
TTA TTC 
K X 


AGG 
TCC 
R 


TTC 
AAG 
F 


CAG 
GTC 
Q 


GCA 
CGT 
A 


GAG 
CTC 

E 


GGT 
CCA 
G 


GGT TAT 
CCA ATA 
g y 


GTG 
CAC 

V 


CTG 
GAC 
L 


TAC 
ATG 
Y 


CCT 
GGA 
P 


CAG 
GTC 
Q 


ATC 
TAG 
I> 




370 




380 






390 






400 




410 






420 








GGG 
CCC 
G 


GAC 
CTG 
t> 


CGG 
GCC 
R 


CTA 
GAT 
L 


GAC 
CTG 

D 


CTG 
GAC 
L 


CTC 
GAG 
L 


TGC 
ACG 
c 


CCC 
GGG 
P 


CGG 
GCC 
R 


GCC 
CGG 
A 


CGG 
GCC 
R 


CCT 
GGA 
P 


CCT 
GGA 
P 


GGC 
CCG 
G 


CCT 
GGA 
P 


CAC 
GTG 
H 


TCC 
AGG 
s 


TCT 
AGA 
s 


CCT 
GGA 
P 


AAT 
TTA 
H 


TAT 
ATA 
Y> 


30 




440 






450 






460 




470 






480 






490 




GAG 
CTC 
E 


TTC 
AAG 
r 


TAC 
ATG 

y 


AAG 
TTC 
K 


CTG 
GAC 
I* 


TAC 
ATG 
Y 


CTG 
GAC 
L 


GTA 
CAT 
V 


GGG 
CCC 
G 


GGT 
CCA 
G 


GCT 
CGA 
A 


CAG 
GTC 
Q 


GGC 
CCG 
G 


CGG 
GCC 
R 


CGC 
GCG 
R 


TGT 
ACA 
C 


GAG 
CTC 
E 


GCA 
CGT 
A 


CCC 
GGG 
P 


CCT 
GGA 
P 


GCC 
CGG 
A 


CCA 
GGT 
P> 


500 






510 






520 




530 






540 






550 




560 


AAC 
TTG 
N 


CTC 
GAG 
L 


CTT 
GAA 
L 


CTC 
GAG 
L 


ACT 
TGA 
T 


TGT 
ACA 
C 


GAT 
CTA 
D 


CGC 
GCG 

R 


CCA 
GGT 
P 


GAC 
CTG 
D 


CTG 
GAC 
L 


GAT 
CTA 
D 


CTC 
GAG 


CGC 
GCG 
R 


TTC 
AAG 
F 


ACC 
TGG 
T 


ATC 
TAG 
I 


AAG 
TTC 
K 


TTC 
AAG 
P 


CAG 
GTC 
Q 


GAG 
CTC 
E 


TAT 
ATA 
Y> 






570 






580 




590 






600 






610 




620 






AGC 
TCG 
s 


CCT 
GGA 
P 


AAT 
TTA 
N 


CTC 
GAG 

Lr 


TGG 
ACC 
w 


GGC 
CCG 
G 


CAC 
GTG 
H 


GAG 
CTC 
. E 


TTC 
AAG 
P 


CGC 
GCG 
R 


TCG 
AGC 
S 


CAC 
GTG 
H 


CAC 
GTG 
H 


GAT 
CTA 
D 


TAC 
ATG 
Y 


TAC 
ATG 
Y 


ATC 
TAG 
I 


ATT 
TAA 
I 


GCC 
CGG 
A 


ACA 
TGT 
T 


TCG 
AGC 
s 


GAT 
CTA 
D> 


630 






640 




650 






660 






670 




680 






690 




GGG 
CCC 
G 


ACC 
TGG 
T 


CGG 
GCC 
R 


GAG 
CTC 
E 


GGC 
CCG 
G 


CTG 
GAC 
L 


GAG 
CTC 
E 


AGC 
TCG 
s 


CTG 
GAC 
L 


CAG 
GTC 
Q 


GGA 
CCT 
G 


GGT 
CCA 
G 


GTG 
CAC 
V 


TGC 
ACG 
c 


CTA 
GAT 
L 


ACC 
TGG 
T 


AGA 
TCT 
R 


GGC 

CCG 
G 


ATG 
TAC 
M 


AAG 
TTC 
K 


GTG 
CAC 

V 


CTT 
GAA 
L> 






700 




710 






720 






730 




740 






750 






CTC 
GAG 
L 


C(A/G)A GTG 
G(T/C)T CAC 
(G/R) V 


GGA 
CCT 
G 


CAA 
GTT 
Q 


AGT 
TCA 
S 


CCC 
GGG 
P 


CGA 
GCT 
R 


GGA 
CCT 
G 


GGG 
CCC 
G 


GCT 
CGA 
A 


GTC 
CAG 
V 


CCC 
GGG 
P 


CGA 
GCT 
R 


AAA 
TTT 
K 


CCT 
GGA 
P 


GTG 
CAC 
V 


TCT 
AGA 
S 


GAA 
CTT 
E 


ATG 
TAC 
M 


CCC 
GGG 
P 


60 




77 0 






780 






790 




800 






810 






820 




GAA 
CTT 
E 


AGA 
TCT 
R 


GAC 
CTG 
D 


CGA 
GCT 
R 


GGG 
CCC 
G 


GCA 
CGT 

A 


GCC 
CGG 

A 


CAC 
GTG 
H 


AGC 
TCG 
S 


CTG 
-GAC 
I* 


GAG 
CTC 
E 


CCT 
GGA 
p 


GGG 
CCC 
G 


AAG 
TTC 

K 


GAG 
CTC 
E 


AAC 
TTG 

N 


CTG 
GAC 
L 


CCA 
GGT 
P 


GGT 
CCA 
G 


GAC 
CTG 
D 


CCC 
GGG 
P 


ACC 
TGG 
T> 



SUBSTITUTE SHEET (RULE 26) 



1 



■ • ■ ■ . ■ I 



WO 97/15667 



PCT/US96/17201 



2/2 

FIGURE 1 (contd) 
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GGG 
P 
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L 


GGG GGA GGG 
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TAC 
M 


GGA CGT CAC CGA CCC 
P A V -A G> 
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GCA GCA 


GGG GGG 


CTG GCG CTG 
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TTG 


CTG 


GGC 


GTG 


GCA GGG GCT 


GGG 
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GCC ATG TGT TGG CGG 




CGT CGT 
A A 


CCC CCC 
G G 
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L A L 
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L 


AAC 
L 
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L 


CCG 
G 
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A M C W R> 
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* 
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* 
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* * * 




AGA COG 
TCT GCC 
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CGG GCC 
GCC CGG 
R A 


AAG CCT TCG 
TTC GGA AGC 
K P S 
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CTC 
£ 


ACT 
TCA 
S 


CGC 
GCG 
R 


CAC 
GTG 
H 


CCT 
GGA 
P 


GGT CCT GGC 
CCA GGA CCG 
G P G 


TCC 
AGG 
S 
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AAG 
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GGG AGG GGA GGG TCT 
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G R G G S> 
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GAC CCC 
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TAC 
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GGA 
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G 


CCT 
GGA 
P 


CGG 
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R 


GAG GCT GAG 
CTC CGA CTC 
£ A E 


CCT 
GGA 
P 


GGG 
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G 


* * * . 
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E L G I A> 
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GGG GTG ATA 
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CTC 
E 
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TTC 

K 


* * * 
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V S *G. D Y> 
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* 
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P 
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Q S P 
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P 
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N 
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^1230 1240 1250 1260 1270 1280 1290 1300 

TGA GGGCTC CTCTCACGTG GCTATCCTGA ATCCAGCCCT TCTTGGGGTG CTCCTCCAGT TTAATTCCTG GTTTGAGGGA 
ACT CCCGAG GAGAGTGCAC CGATAGGACT TAGGTCGGGA AGAACCCCAC GAGGAGGTCA AATTAAGGAC CAAACTCCCT 
*> 

1310 1320 1330 1340 1350 * 1360 1370 1380 

CACCTCTAAC ATCTCGGCCC CCTGTGCCCC CCCAGCCCCT TCACTCCTCC CGGCTGCTGT CCTCGTCTCC ACTTTTAGGA 
GTGGAGATTG TAGAGCCGGG GGACACGGGG GGGTCGGGGA AGTGAGGAGG GCCGACGACA GGAGCAGAGG TGAAAATCCT 

139 * * 1400 1410 1420 1430 1440 1450 1460 

TTCCTTAGGA TTCCCACTGC CCCACTTCCT GCC CTC C CGT TTGGCCATGG GTGCCCCCCT CTGTCTCAGT GTCCCTGGAT 
AAGGAATCCT AAGGGTGACG GGGTGAAGGA CGGGAGGGCA AACCGGTACC CACGGGGGGA GACAGAGTCA CAGGGACCTA 

1470 1480 1490 1500 1510 1520 1530 1540 

CCTTTTTCCT TGGGGAGGGG CACAGGCTCA GCCTCCTCTC TGACCATGAC CCAGGCATCC TTGTCCCCCT CACCCACCCA 
GGAAAAAGGA ACCCCTCCCC GTGTCCGAGT CGGAGGAGAG ACTGGTACTG GGTCCGTAGG AACAGGGGGA GTGGGTGGGT 

1550 1560 1570 1580 1590 1600 1610 1620 

GAGCTAGGGG CGGGAACAGC CCACCTTTTG GTTGGCACCG CCTTCTTTCT GCCTCTCACT GGTTTTCTCT TCTCTATCTC 
CTCGATCCCC GCCCTTGTCG GGTGGAAAAC CAACCGTGGC GGAAGAAAGA CGGAGAGTGA CCAAAAGAGA AGAGATAGAG 

1630 1640 1650 1660 1670 1680 1690 1700 

TTATTCTTTC CCTCTCTTCC GTCTCTAGGT CTGTTCTTCT TCCCTAGCAT CCICCTCCCC ACATCTCCTT TCACCCTCTT 
AATAAGAAAG GGAGAGAAGG CAGAGATCCA GACAAGAAGA AGGGATCGTA GGAGGAGGGG TGTAGAGGAA AGTGGGAGAA 

1710 1720 1730 1740 1750 1760 1770 1780 

GGCITCTTAT CCTGTGNCTC TCCCATCTCC TGGGTGGGGG NATCAAAGCA TTTCTCCCCT TAGCTTTCAG CCCCCTTCTG 
CCGAAGAATA GGACACNGAG AGGGTAGAGG ACCCACCCCC NTAGTTTCGT AAAGAGGGGA ATCGAAAGTC GGGGGAAGAC 

1790 1800 1810 1820 1830 1840 1850 I860 

ANCTCTCATA CCAANCACTC CCCTCAGTCT GTCAAAAATG GGG GG CTT AT GGGGAAGGGT CTGACAATCC ACCCCAGGTC 
TNGAGAGTAT GGTTNGTGAG GGGAGTCAGA CAGTTTTTAC CCCCCGAATA CCCCTTCCCA GACTGTTAGG TX3GGGTCCAG 
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